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We investigate theoretically the behavior of proteins as well as other large macromolecules which 
■ are incorporated into amphiphilic monolayers at the air-water interface. We assume the monolayer 

to be in the coexistence region of the "main" transition, where domains of the liquid condensed 
phase coexist with the liquid expanded background. Using a simple mean-field free energy account- 
ing for the interactions between proteins and amphiphilic molecules, we obtain the spatial protein 
distribution with the following characteristics. When the proteins preferentially interact with either 
the liquid condensed or liquid expanded domains, they will be dissolved in the respective phase. 
When the proteins are energetically rather indifferent to the density of the amphiphiles, they will 
be localized at the line boundary between the (two-dimensional) liquid expanded and condensed 
phases. In between these two limiting cases, a derealization transition of the proteins takes place. 
This transition is accessible by changing the temperature or the amount of incorporated protein. 
These findings are in agreement with recent fluorescence microscopy experiments. Our results also 
apply to lipid multicomponent membranes showing coexistence of distinct fluid phases. 
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I. INTRODUCTION 

Monolayers of amphiphilic molecules spread on liquid surfaces have traditionally been studied as models for bio- 
logical membranes Such insoluble and monomolecular films made of suitable phospholipids or fatty acids are 
stable over a wide range of surface pressures and temperatures due to the strong reduction of the water surface tension 
| and are called Langmuir monolayers ^j. In typical experiments, the amphiphiles are solubilized in a volatile solvent 
and placed on the air-water interface. As the solvent evaporates, the amphiphiles spontaneously spread and form a 
monolayer. When the insoluble film is then compressed (while keeping the temperature fixed), the lateral pressure 
can be measured as a function of the area per amphiphilic molecule in analogy to bulk isotherms. 

Using film balance techniques JllMSHfl], the following general picture emerged. When the extremely expanded 
film is compressed, it produces the liquid expanded phase (LE), which, at low enough temperatures, transforms upon 
further compression into the liquid condensed phase (LC). At much lower surface concentrations and at low enough 
temperatures, the monolayer undergoes a first-order transition into a gaseous phase. At very high lateral pressures, 
solidification occurs, as indicated by a discontinuity in the pressure-area isotherms. Subsequently, these systems were 
also studied using X-ray and neutron scattering techniques, indicating the existence of a large number 

of different condensed phases. In this paper we will be concerned only with the LE/LC transition. Therefore, we do 
O ■ not introduce appropriate order parameters needed to distinguish the different condensed phases Jll| . 

The nature of the LE/LC transition has been the subject of much discussion p2]. It is analogous to the "main" 
transition in lipid bilayers Q , where domains of the LE and the LC phase have been shown to coexist over a wide 
range of lipid surface concentrations (or area per molecule) . In this coexistence region, the condensed domains show 
a large variety of different shapes [ fl3| and grow as the area per molecule is decreased, whereas the number of domains 
depends on the initial conditions and typically stays fixed. The isotherms in the coexistence region, however, were 
found to be non- horizontal, which led to the postulation of a limited cooperativity of this transition [Q. For the 
case of single-chain fatty acids, it was later shown that the isotherms approach zero slope as the material used is 
progressively purified [12]. 

On the theoretical side, the LE/LC transition has been modeled based on various microscopic pictures of the 
interaction between surfactant (or lipid) molecules including translational as well as internal degrees of freedom 
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The biological function of membranes depends mostly on the incorporation of proteins and other macromolecules 
into the lipid layers. Functionality and efficiency of these inclusions depend crucially on microscopic details of the 
embedding in the lipid matrix, which can occur in different ways. Monolayers at the air-water interface are suitable 
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for the study of the interaction between lipids and proteins, since they are rather well-defined and allow the control of 
independent thermodynamic parameters which are otherwise fixed in a bilayer membrane, like the area per molecule. 
Also, the observational techniques are well developed. Direct visualization of the phase behavior of monolayers can be 
obtained using fluorescence microscopy techniques. Here, a fluorescent dye probe is incorporated into the monolayer 
the lateral distribution of which can be obtained from the analysis of fluorescence micrographs. Contrast in the images 
is obtained as a result of different dye solubility, fluorescence quantum yield, or molecular density of coexisting phases 
p9| . A complementary and recently developed technique is Brewster-angle microscopy, which allows imaging of a 
monolayer without the addition of fluorescent probes ]20(] . 

After injection of a water-soluble protein into the aqueous subphase, the surface tension typically decreases, indi- 
cating that the protein is at least partially incorporated into the monolayer 22 2j| . This is due to the protein 
affinity to the water/air interface. The specific type of this attraction is not well understood and probably is due in 
part to structural changes (denaturation) of the protein in the monolayer or at the water surface, associated with the 
unfolding of hydrophobic groups. 

One of the striking experimental observations p2| , p4| was that some proteins adsorb preferentially along the boundary 
line between the LE and LC domains when the monolayer is in the LE/LC coexistence region. These observations 
were made for fluorescently labeled small proteins, such as concanavalin A [ p4| or streptavidin p2| , interacting with 
phospholipid monolayers. These experimental findings motivated our present theoretical study. 

In the following, we describe a simple model, which (i) assumes the LE/LC transition to be a simple first-order 
condensation transition, yielding coexisting domains for temperatures below the critical temperature, and (ii) includes 
the effect of proteins which are adsorbed into the monolayer. Assuming that the proteins are completely incorporated 
into the monolayer, this simplistic model leads to an entropic force which tends to localize the protein at the boundary 
between LE and LC domains. Depending on the energetic preference of the protein for the LE or LC phase, the protein 
will be either dissolved in the LE or in the LC domain, or, if there is no pronounced preference, will be localized at 
the boundary. 

Phase separation in amphiphilic layers is also observed for freely suspended multicomponent bilayers Here, 
the coexisting phases are distinguished by their compositions. The most important examples include mixtures of 
phospholipids with cholesterol p5j and mixtures of different phospholipids |2q ], and in both cases the coexisting 
phases are in a fluid state. These phenomena are of great biological interest since biological membranes are always 
multicomponent mixtures and lateral organization into domains is supposed to play an important functional role. We 
note that our results apply directly to these situations as well, although we will limit our terminology to the situation 
of coexisting dense and dilute phases for one-component systems at the air-water interface. For the case of freely 
suspended membranes, our findings imply a simple mechanism for the localization of integral membrane proteins 
along the one-dimensional boundary between coexisting domains. The resulting enrichment of proteins might be a 
prerequisite for proper biological function in certain cases. 

In the following sections we formulate the model (Sect. II), inspect the minima of the free energy (Sect. Ill), solve 
the corresponding Euler-Lagrange equations in the coexistence region (Sect. IV), and calculate profiles both for the 
lipid and the (coupled) protein densities (Sect. V). From the profiles we generate a general phase diagram featuring 
localized, semi-localized and delocalized protein phases. We also calculate the total amount of adsorbed protein, the 
protein excess T (Sect. VI), and the line tension r of the LE-LC line interface (Sect. VII). It turns out that the 
line tension is strongly reduced by the adsorption of proteins. A finite solubility of the proteins in the subphase is 
taken into account in Sect. VIII. Finally, the connection to experimentally measurable quantities, such as the surface 
pressure LI, is made in Sect. IX. 



II. THE MIXED LIPID AND PROTEIN FREE ENERGY 



Consider the air- water interface with proteins, lipid molecules, and artificial "vacancies", with area fractions <pp, 
ifil,, and <pv, respectively, satisfying (ftp + 4>l + i>v = 1- The vacancies are introduced in order to allow for independent 
variations of the protein and lipid concentrations, hence making coexistence of dilute and condensed regions of the 
monolayer possible. Inscribing the system on a lattice, with a lattice constant corresponding to the size of a lipid 
molecule, the free energy of mixing per lattice site within a mean field theory can be written for the three-component 
mixture as a sum of the enthalpy and entropy of mixing, T = hi — TS. The enthalpy of mixing includes all pair-wise 
interactions between the three species: 

U/T = E LL (f> 2 L + E VV (j)y + E PP (j)p + E LV 4> L (j) V + Ep L <j>p4>L + E pv ^p4>v (1) 

and the Eij are the dimensionlcss interaction parameters for all possible pairs. The entropy of mixing is related to 
the total number £1 of distinct microscopic configurations 
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where TV is the total number of lattice sites and the Boltzmann constant is set to unity (fcp = 1). In the random- mixing 
approximation, 

(N<j> P /a)\(N(l - </> P )/a)l (J\ty L )!(J\tyy)! 1 ; 

where the constant a > 1 denotes the ratio between the compact area occupied by a protein molecule and a lipid 
molecule at the interface. The above expression is the product of the number of all protein configurations and the 
number of all lipid/vacancy configurations in the remaining area not taken up by the proteins. Using Stirling's 
Formula in the thermodynamic limit, defined by N — > oo, the expression for S can be simplified 

S = -4> L \og{4>h) - 4>v logOv) - 4>p \og(4> P )/a - (1/a - 1)(1 - 4>p) log(l - 4> P ) (4) 

It is convenient to define the thermodynamic potential 

G/T = TjT - fip<j>p - fJ,L(4>L ~ 4>v) (5) 

where the chemical potentials \ip and [1l are coupled to the protein concentration (ftp and the difference between the 
lipid and vacancy concentrations, <j>L — 4>v, respectively. 

In (l)-(5), long-range interactions between the proteins, such as electrostatic forces, are not taken into account. In 
addition, the free energy of mixing assumes a confinement of the protein and lipid to the two-dimensional plane of the 
air-water interface. In fact, the variation of the protein concentration perpendicular to the monolayer in the subphasc 
can be taken into account approximately and leads to a renormalization of the parameters of the two-dimensional 
model, as shown in Sect. VIII. 

The lipid order parameter 77, corresponding to the density of lipid molecules, can be written as 

f]=(f>L-(t>V (6) 

Using that <j)p + 4>l + 4>v = 1, and defining the protein concentration as <j>= <f> P , the free energy T and the potential 
g can be rewritten as 

T/T= -(J+l/2)r 1 2 + Lcp 2 + \r ] 0+ (7) 
(l + V-4>) log[(l +V~ 0)/2]/2 + (1 - V - 4>) log[(l -V- 0)/2]/2 + 
4>iog[<f>]/a + (1/a - 1)(1 - cj>) log[l - 4>] 

and 

Q/T = TjT - w - (/x + log 2)0 (8) 

where constant terms have been omitted and linear terms in 77 and 4> have been dropped out from T for convenience. 
They merely contribute a constant shift to [i and fi v in g. The reduced interaction parameters: J, L, n and A are 
related to the original Ey and /ip in the following way 

-J=- A {E LL +E VV -E LV ) + ^ (9) 

L = -(Ell + Evv + Elv) + Epp — -(Epl + Epv) (10) 

A = - ^(Ell — E V v — Epl + Epv) (11) 

M = Mp + \(Ell + Evv + E LV - E PL - E PV ) - log 2 (12) 
The constant log 2 appears in the definition of /j in order to render the simplified expression (13) in a simpler form. 
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The above expression for Q is studied in Sect. Ill for different values of the various parameters and the corresponding 
bulk phase-diagrams are obtained. For the study of protein profiles, one can further simplify this expression. First, 
for small values of the order parameters, i.e., relatively close to the critical point of demixing of the lipid and for small 
protein concentrations, it is legitimate to expand the free energy of mixing up to order 0(?y 4 ) and 0(</> 2 ). In addition, 
since typical proteins occupy a much larger area then lipids, the area ratio is in the range of a ~ 50 — 100, and the 
protein entropy terms (of order 1/a) can be neglected in (7). The validity of the latter (a — * oo) approximation will 
be reexamined in Sect. Ill ^7|. With these simplifications, the approximated free energy density can be written as 

J*/T = -Jri 1 +^+L<t> 2 + \T 1 <t>+\r?<t> (13) 

where the simplified thermodynamic potential using (8) is given by Q°/T = J 70 /T — n<f> — Mij 7 ?- The free energy density 
(13) needs some further discussion. Coexistence between dense (77 > 0) and dilute regions (rj < 0) requires that J > 
and a positive fourth-order term °q is needed to stabilize the free energy. The protein itself is assumed not to be 
close to any phase transition. Hence L > and no higher order terms in <j) are needed. We include in the expansion 
only the two lowest coupling terms between the protein and lipid concentrations. The first is the bilinear coupling 
r](j) and has an enthalpic origin. It reflects the overall preference of the protein to more condensed (A < 0) or more 
dilute (A > 0) regions of the lipid monolayer. The second coupling is the symmetric n 2 cj) term, which is invariant 
under rj — ► — 77 transformation and provides the driving force for the localization of proteins at the LE-LC interface. 
In our mean-field model, taking into account only pair interactions, this coupling has a purely entropic origin. More 
generally, it can also include interaction terms of higher-order in a virial expansion. Finally, the higher-order coupling 
terms r] 2 4> 2 and rf4> are not considered here since we try to investigate the most simple and yet non-trivial type of 
coupling. A similar free energy coupling has been introduced in the context of polymer adsorption at liquid-liquid 
interfaces, where in analogy the polymer adsorbs preferentially at the interface from the bulk solution pq ]. 

For the case where the proteins in the monolayer are in equilibrium with a solution of proteins in the aqueous 
subphase, the protein chemical potential [x corresponds to the free energy of adsorbing proteins from the subphase 
into the monolayer and depends on the concentration of proteins in the subphase; this is discussed in Sect. VIII. Since 
we consider an insoluble (Langmuir) monolayer, similar considerations do not apply to the chemical potential ix n of 
the lipid order parameter 77. In fact, fi^ will be uniquely determined by the requirement of coexistence between dense 
and dilute lipid regions. For proteins which are insoluble in the subphase, the chemical potential fi acts as a Lagrange 
multiplier fixing the total amount of protein in the monolayer, which is a conserved quantity in this situation. 

In the LE/LC two-phase region, obtained for J > 0, one finds experimentally domains of typically circular shape 
of LC phase immersed in a background of LE phase. Since the domains are rather large (~ 10 — 100/jto), we neglect 
the shape of the line boundary between the LC and LE regions and assume variation of the lipid concentration only 
along one spatial direction (the x direction) and translational invariance along the perpendicular direction. The free 
energy 7 per unit length of this line boundary (related to the line tension r of the interface as calculated in Sect. VII) 
is given by 

00 



7 = / Xdx (14) 

J — 00 

where the free energy density X includes contributions associated with spatial variations of the concentrations. Defining 
the "stiffness coefficients" and g v for the protein and lipid concentration profiles, respectively, the free energy density 
X is given by 



2^ \dxj 2 a " \dx 

In the next section we study the bulk phase diagram based on the thermodynamic potential (8). In the subsequent 
sections we use the simplified expression (13) and determine the concentration profiles 4>{x) and rj(x) by applying a 
variational principle to the free energy functional 7. 



III. THE PHASE DIAGRAM 



The phase diagram as a function of the chemical potentials ^ and \x can be obtained from the thermodynamic 
potential (8) by minimizing Q with respect to the order parameters 77 and </> in the two-phase coexistence region [ p9[ . 
The coexisting solutions, denoted by (771, <j)\) and (772, are determined from the equations 
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_ dT_ 
^ n dr] 



dT_ 

dr] 



F{t) U fa) -^ r (r?2,0_2) ^ 



l n 5 ^ 

At + log 2 = — 



dT 



(17) 



which correspond to a common-tangent construction. These equations can be easily solved numerically. In order to 
estimate the role of the protein-lipid area ratio, a, and to compare the results with the calculations presented in the 
next section based on the simplified expression (13), where a — > oo, we restrict the numerical analysis to the values 
J = 1/10 and L = 10. The small value of J means that one is close to the critical point of the lipid phase separation, 
and the expansion in powers of r], leading to (13), is appropriate. The large value of L means that the protein 
concentration is rather small everywhere and can be treated as a small perturbation. We will need this assumption 
for the analytic solution of the Eulcr-Lagrange equations in Sect. IV. The parameter a will be scanned in a rather 
wide range. With this choice of L and J, it is clear that the simplified free energy expression (13) is asymptotically 
obtained for a — > oo. 

The protein concentrations in the coexisting dense and dilute lipid regions scan a whole range of different values, 
depending on the values of the remaining parameters fi and A, but are strictly bounded below by ~ exp(— a). In 
contrast, the simplified free energy expression (13) has solutions with non-zero and strictly zero protein concentrations, 
because of the a — > oo limit. It therefore allows for straightforward classification of the bulk protein ordering into a 
phase with finite protein concentration and a phase with no proteins at all. We need a similar criterion for the case 
of the full free energy expression (7) with a finite, allowing us to distinguish in a categorical manner the presence 
of proteins from the absence of proteins, even in the inevitable presence of an exponentially small (in a) protein 
concentration. We adopt the simple criterion which consists of calculating the Laplacian of the protein concentration 
in the parameter space (/x, A), 

dHi&fa 

W + W (18) 

in the two coexisting phases fa and fa ■ This scalar quantity shows a pronounced line of maxima in the parameter 
space, separating two phases with small and large concentrations of proteins. The position of this ridge is determined 
numerically and defined as the boundary between the two phases rich and devoid of proteins, respectively, for each 
solution fa. The result of this operation leads to three distinct phase regions and is shown in Fig. 1 for the values 
a = 10, 50, and 200. Anticipating the definitions (28) and (30), we present the results in terms of the rescaled variables 
a = a*/(3J) and c = X/ J/2. The results obtained for a = oo are denoted by solid lines. In the region denoted "no 
proteins" both protein concentrations fa and fa are very small (exponentially in —a); in the region "semi-localized" 
only one concentration is small while the other is finite (distinguished by the criterion described above), and in the 
region "delocalizcd" both phases have finite protein concentrations. 

In the next section we will calculate the protein profile explicitly and, in addition, obtain a "localized" phase. This 
phase cannot be distinguished from the "no protein" phase by just looking at the bulk free energy. In fact, in this 
phase there is a finite protein concentration only at a finite distance from the boundary between the LE and LC 
regions. As one can see from Fig. 1, the phase boundary for a = 50 (long dashes) is already fairly close to the 
asymptotic boundary (a — > oo, solid line), so that neglecting the protein entropy is already a good approximation for 
moderately large macromolecules. 

IV. EULER-LAGRANGE EQUATIONS 

In this section we calculate the protein concentration profile based on the free energy expression (15). Minimization 
of the line free energy 7 (14) leads to the Euler-Lagrange equations (denoting d<fr/dx by fa, etc.) 

dl_d_dl_ () 
dr) dx drf 

dT d dT _ 
d0 ~d^ThfS~° 

Using the full free energy of mixing (7), one obtains two coupled second-order and non- linear differential equations of 
the form 
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, „ , 1, / 4(1 -0) 2 \ 1 / \ d 2 

For the actual calculation of concentration profiles, we will use the simplified free energy expression (13), leading to 
the more compact expressions 

-M„ - 2 Jr; + ^ 3 + \<f> + rj<f> = g n ^ (21) 
1 

- f i + 2L^ + X V + -rf =94>^ (22) 

These are the same equations that were considered by Halperin and Pincus in the context of polymer adsorption at 
liquid- liquid interfaces Pq| . 

Instead of solving (21)-(22) numerically, we recall that for large values of L we can treat the protein area fraction 
as a small parameter. As a zeroth-order approximation, we neglect the terms depending on <fi in (21) and obtain as 
a solution the lipid order parameter profile f]o(x) in the absence of proteins. This profile is then inserted into (22), 
yielding the protein profile 4>(x). The validity of this approach, namely solving the equation (21) while neglecting 
the coupling between rj and <fi and inserting the solution into equation (22), is critically examined in Appendix B. 
There, it is found that this approximation indeed corresponds to the first term in an expansion, in which the protein 
concentration functions as the expansion parameter and which therefore is valid for small protein concentrations. 

To proceed, setting <j> — in (21) leads to 

r)o(x) = Voo tanh(x/£ v ) (23) 

with the definitions 

Voo = V6J (24) 



6, = y/gjj (25) 



This is the solution of the usual 4th order Ginzburg-Landau free energy expansion and is strictly valid here only for 
the pure lipid. The lipid order parameter varies between +7joo for x — > oo and — ry^ for x — » — oo, and its width is 
characterized by the correlation length The chemical potential fi n is zero in the approximation employed above. 
The origin is chosen as the symmetric point between the liquid condensed phase (x > 0) and the liquid expanded 
phase (x < 0). Defining a rescaled length u = x/£ v and a rescaled protein density $(x) = 4L0(x)/(7y oo ) 2 , the second 
differential equation (22) is reduced to 



^(u)~h(u) = b 2 ^Pfi (26) 
dw z 



with the inhomogeneous term h(u) given by 

h(u) = a — tanh 2 (w) — ctanh(w) (27) 

The remaining rescaled parameters are 

2 » » (28) 



(r/oo) 2 3J 



(29) 



G 



= 2A _ A 
C ~ »?oo ~ V/3J/2 

The parameter a ~ is the rescaled chemical potential, 6 is the relative stiffness of the lipid concentration profile 
compared to the protein concentration profile, and c <~ A measures the preference of the proteins for the dense (c < 0) 
or dilute (c > 0) lipid domains. The correlation length of the protein distribution is defined by = ^g^/2L. 
The general solution of the second order differential equation (26) can be written as 

$(u) = Asinh(u/6) + Bcosh(u/b) + a - $i(u)/6 - c$ 2 (u)/6 (31) 

where the functions $i(u) and $2(«) are given in Appendix A. The constants A and B have to be determined in 
accord with the boundary conditions. 

V. PROTEIN DISTRIBUTION 

A. Solution for the case 6 = 

It is instructive to treat first the limiting case where the stiffness of the protein distribution vanishes, i.e., = 
and = 0. Then, one has 6 = and the solution of (26) is trivially given by = h(u). This leads to the protein 
distribution 

$» = ( h a {u) l° T % u \ ^ ° (32) 
K ' \ for h(u) < v ; 

where the restriction to a finite range in u follows since $°(u) has to be positive. In fact, for 6 = 0, only for h(u) > 
the protein distribution is correctly described by the differential equation (26); inspection of the free energy density 
2 in the limit a — > 00 shows that the value of &(u) which minimizes X for h(u) < is given by $(u) = 0. This failure 
of the variational methods used in deriving (26) is due to the fact that one requires $(it) to be positive in the limit 
of very large proteins, a — > 00. 

Hereafter, we choose c > with no loss of generality, since the problem defined by (26) and (27) is symmetric under 
a simultaneous inversion of c and u (c — > — c and u — > —u). Using the asymptotic behavior of h(u), 

, / n f a - 1 + c for u — > -00 , - 

^ ^ — I a — 1 — c for u — > +00 ^ ' 

the following classification emerges: (i) For a < 1 — c, the protein distribution vanishes both for positive and negative 
values of u at a sufficiently large but finite distance from the interface (which is located at u = 0); one actually obtains 
a nonvanishing, localized distribution of proteins provided that h(u) > for some range of u, but this cannot be seen 
from the bulk behavior; (ii) for 1 — c < a < 1 + c, the distribution is semi-localized and vanishes only for sufficiently 
large positive values of u and stays finite as u — > — 00, and (iii) for a > 1 + c the distribution is delocalized and stays 
finite in both limits u — ► ±00. These three regimes are in accord with the phase diagram obtained in Sect. Ill and 
Fig. 1 for finite a and in the a — > 00 limit. 

An additional observation can be made for c < 2, where h(u) has one maximum located at 

u m ax = -tanh _1 (c/2) (34) 

with a height 

h(u max ) =a + c 2 /A (35) 

(in the limit c — > 2 one obtains u ma2 ~ log (2 — c)/2). Consequently, for c < 2, the line defined by a = — c 2 /4 marks 
the border between a fourth regime where the protein distribution vanishes identically (for a < — c 2 /4) and the regime 
where this distribution is non-zero (for a finite distance from the boundary between dense and dilute lipid regions). 
Fig. 2 summarizes these borderlines in a phase diagram, which is in fact valid also for b ^ 0, as will be discussed in 
the next subsection. The localized regime is shaded in gray and ends at a special point S, at which the maximum of 
the protein distribution is at infinity; as pointed out before, there is an overall symmetry around the a— axis (c = 0). 

The effective correlation length ^ e ff for the proteins in the localized regime can be estimated from the curvature 
of at the maximum u max , 

C// = -h"(u max ) = 2(1 - c 2 /4) 2 (36) 
This length diverges as one approaches the special point S, where the distribution becomes indefinitely broad. 
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B. Solution for b > - 



general considerations 



On physical grounds, the solution for non-zero 6, i.e., for a finite stiffness of the protein distribution, has to coincide 
with the solution found for b = in the preceding section very far from the interface located at u = 0. This leads to 
the general boundary condition 



where $°(it) is given by (32) and the general solution is defined to be the concentration profile which minimizes 
the free energy functional (14). In the following, we discuss the properties of the general solution <j>(u) separately for 
the four regions distinguished in Fig. 2. 

i) In the delocalized case, the boundary conditions (37) occurring at infinity together with the differential equation 
(26) valid for the entire (—00,00) range in u are sufficient to determine the distribution $(tt). 

For the other cases, the boundary conditions (37) have to be supplemented by additional conditions at finite values 
of u; the distribution 3>(u) is described by (26) only in a finite interval of u. 

ii) In the case where h{u) < for all u, it follows from the requirement <&(w) > that — h{u) > and thus all 
possible solutions of (26) have strictly positive curvature as can be seen by looking at (26). The boundary conditions 
(37), which imply that $(u) = as u — > ±00, can not be satisfied for any non- vanishing solution of (26). Consequently, 
the protein distribution which minimizes the free energy is given identically by $(w) = 0. This vanishing solution was 
also found for b = 0. 

iii) When h(u) is positive in some finite interval of u but negative for u — > ±00, all solutions of (26) which are 
positive definite everywhere have positive curvature for u — > ±00 and are not compatible with the boundary conditions 
as given by (37). This merely reflects the fact that (26) describes the distribution <I>(u) only in the finite interval 
u\ < u < u 2l in which $(w) > 0. The same was found to be true for b = in the last section. From (37) in 
combination with (32), $(it) has to vanish for u — > ±00, and can be positive for finite u. As follows from minimizing 
the free energy functional 7 (14), the solution has to be smooth everywhere and thus fulfills <&(w) = $'(w) = 
at the two boundaries u = u\ and u = u 2 . 

Now the following statements can be made: a) There have to be intervals of u where <fr(u) has negative curvature 
in order to fulfill the boundary conditions 4>(u) = at u = u\ and u — u 2 ; b) close to the boundaries u = u\ and 
u = u 2 , the curvature has to be positive in order to fulfill $'(«) = at u = u\ and u = u 2 ; c) consequently, the 
solution <i>(w) crosses h(u) at two values of u inside the region bounded by u — u\ and u = u 2l at which the curvature 
of $(u) vanishes; this can be seen from (26). It follows that the boundaries u\ and u 2 do not coincide, which means 
that the protein distribution does not vanish identically. We conclude that whenever the distribution <&°{u) does 
not vanish for b = 0, it is non-vanishing for any b 7^ 0. Note that it is actually possible to construct a solution $(u) 
in accord with the boundary conditions at u\ and u 2 since the general solution (31) has two adjustable parameters A 
and B. 

iv) For the semi-localized case, the boundary condition (37) applies to the solution of (26) for u — > —00 only. The 
protein distribution is non-zero in the u interval (— 00, u 2 ) and the boundary value u 2 satisfies $(u 2 ) = $'(u 2 ) = 0. 

Putting together these arguments for the different regimes, it follows that the phase diagram in Fig. 2 is valid for 
general b > 0. 



In the following, we specify the boundary conditions for general b for the three different cases showing non- vanishing 
protein distributions: 

In the delocalized regime, the boundary conditions obtained from (37), (33), and (32) are 



$(u) = 



(u) for u — > ±00 



(37) 



C. Boundary conditions 



$(±00) = h(±oo) 



a — I =p c 



(38) 



These boundary conditions determine the coefficients A and B of the general solution (31). 
In the semi-localized regime, one has the conditions 



$(-00) = h(-oo) = a - 1 + c 



(39) 



and 



$(w 2 ) = $'M = 



(40) 
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which determine the position of the boundary value, u 2 , and the coefficients A and B. 
In the localized regime, one has 

$(wi) = $'(tti) = (41) 

$(w 2 ) = &(u 2 ) = (42) 

Here, the boundary conditions determine ui, u 2 , A, and £>. In what follows, we always assume that ui < u 2 , with no 
restrictions on the generality. 

In the following, we present explicit protein profiles <fr(u) for the limiting cases b = and 6=1. The latter value 
corresponds to the case where the correlation lengths of the lipid and protein concentration profiles are equal, £ v = 
Also, for 6 = 1, the general solution of the protein profile as given in Appendix A can be written in a simpler analytical 
form. 

Delocalized case: for 6=1, the coefficients are determined to be B = 7r/2 — 2 and A = c(l — 7r/2); the protein 
distribution, given by (31), then reads 

$(m) = a — 2 + 2tan~ 1 [tanh(u/2)](ccosh'u — sinhu) + 7r(coshu — csinhu)/2 (43) 

Using the equalities 

tan" 1 [tanh(u/2)] = tan" 1 ^"] - tt/4 = tt/4 - tan" 1 ^"] (44) 
the protein distribution can be rewritten as 

$(w) = a - 2 + 7re"(l - c)/2 + 2tan- 1 [e tt ](ccoshu - sinhu) (45) 

or 

$(m) = a - 2 + ttc-" (1 + c)/2 - 2 tan" 1 [c" u ] (c cosh u - sinh u) (46) 

in accord with the limiting values <&{u) = a — licforu^ =Foo. 

Semi-localized case: For 6=1, the boundary condition at u = —00 leads to the relation A = 2 + B + c — 7r(l + c) /2. 
The protein distribution can be written as 

$(w) = a - 2 + e u (B + 2 - ctt/2) + 2 tan- 1 [e tl ](ccoshu - sinhw) (47) 

which indeed satisfies the boundary condition as given by (39). The coefficient B and 112 are in turn determined by 
the second boundary condition (40). 

Localized case: for general 6, the boundary conditions (41) and (42) can be cast in a more explicit form. Defining 



with 



and 



with 



cosh(u/6)$(u)/6 - sinh(u/6)$'(u) = B/b + p{u) (48) 
p(u) = cosh(w/6)(a/6- $i(u) - c $ 2 (u)) + 6sinh(u/6)($i(u) +c $ 2 (u)) (49) 
sinh(w/6)$(u)/6 - cosh(u /b)^'(u) = -A/b + n(u) (50) 



k(u) = sinh(w/6)(a/6 - - c $ 2 (m)) + 6cosh(w/6)($i(u) + c $' 2 (tt)) (51) 

leads to the equations 

-S/6 = P ( Ul ) = p{u 2 ) (52) 

A/6 = k(ui) = k(u 2 ) (53) 
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Equations (48)-(51) have to be solved simultaneously in order to determine u\, U2, A, and B. For the case 6=1, the 
functions p(u) and k(u) take the simpler form 

p{u) = (a — 2) coshw + 2 + tanhwsinhM + 2ctan~ 1 [tanh(w/2)] — csinhu (54) 

k(u) = (a — 1) sinhu + 2 tan -1 [tanh(u/2)] + c(l — coshu) (55) 

In the remainder of this section, we present protein profiles calculated from the above equations for several values 
of the three parameters a, 6, and c. Figure 3 shows protein distributions for four different values of a and for the two 
simple cases 6 = (solid lines) and 6=1 (broken lines). We set c = 0, so the protein profiles are symmetric about the 
LE/LC boundary located at u = 0, where the lipid concentration profile as given by (23) has an inflection point. For 
vanishing stiffness of the protein distribution (6 — ► 0), the profiles have discontinuous slopes for a < 1 at the points 
where the protein concentration vanishes; the main effect of a non- vanishing stiffness parameter 6 is to eliminate these 
discontinuities, thereby flattening the entire concentration profile, as is clearly seen in Fig. 3. 

Figure 4 shows asymmetric protein distributions for four different values of c on the transition line between the 
localized and the semi-localized regimes, defined by a = 1 — c. Again, solid lines denote results for 6 = and broken 
lines denote results for 6=1. As for the symmetric distributions shown in Fig. 3, a non-zero stiffness parameter 6 
removes the discontinuity of $'(u) at the boundary u 2 and flattens the concentration profile. As c approaches the 
value 2, the maximum of the distribution moves progressively away from the LE/LC boundary located at u = 0. 
Also, the overall protein concentration rapidly decreases. In the limit c — ► 2, the position of the maximum actually 
diverges logarithmically, as follows from (34). 

Figure 5 gives the localized protein distribution $(u) for c = and a = 0.5 for six different values of 6, where u-i 
and B have to be determined numerically from (42) applied to the general solution (31). Interestingly enough, the 
boundary values U2 = —u\ do not diverge as 6 — > oo but approach finite values u\ t 2 = Tl-915. As the stiffness of the 
protein distribution increases, the concentration is flattened and the area under the curves decreases, but the profile 
does not spread out indefinitely and stays localized. 



VI. THE PROTEIN EXCESS 



The protein excess is the total amount of adsorbed proteins. In the localized regime, this quantity is defined as 

/ + OO l>U2 
$(u)du= / <5>{u)du (56) 
-oo J U\ 

In the dclocalized and the semi-localized regimes, the quantity Y as defined above diverges since the protein distribution 
approaches a constant non- vanishing value as u —* — oo (for the delocalized case the same is also true asu^ oo). One 
can still extract a meaningful quantity defined by the excess amount of protein adsorbed by subtracting the protein 
concentration at u = ±oo, where $(±oo) = a — 1 =p c. For — 2 < c < 2 the protein distribution has one maximum, 
and we define the protein excess as 

/"max POO 
($(u)-*(-oo))du+ / (#(u) - $(oo))du (57) 

where u max is the value of u for which $(u) reaches its maximum. 



A. Protein excess for 6 = 



The protein excess Y can be calculated for 6 = in closed form for all parameter values. With = h(u) = 

a — tanh 2 (u) — ctanh(w), the excess can be written as 



fU2 

Y= (a — tanh 2 (u) — ct&nh(u))du 
J iii 

where the integration boundaries are given by 

U12 = tanh -1 — - =F \j — - + a 



(58) 



(59) 
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For the symmetric case, c = 0, the boundaries u± and u 2 have the values 

ui, 2 = T tanh -1 \fa 



(60) 



and on the transition line between the localized and the semi-localized regimes, given by a = 1 — c, one obtains 
u\ = — oo and 



u 2 = tanh 1 (1 — c) 
Vc 2 + 4a 



For general a and c, the integral (58) yields 

T = \]c? + 4a + (a - 1 + c) tanh -1 
In the symmetric case, c = 0, this expression reduces to 

r = (a - ljtantT 1 



1 + a 



— ctanh 1 



+ Via 



Vc 2 + 4a(2 + c) 
2 + 2a + 2c + c 2 



1 + a 

and on the localized to semi-localized transition line, a = 1 — c, it reduces to 

"4-c 2 " 



r = 2-c-ctanh~ 1 



4 + c 2 



2-c-clog(2/c) 



(61) 



(62) 



(63) 



(64) 



Lines of constant T for 6 = calculated from (62) are shown as broken lines in Fig. 2. Those lines can be helpful 
in interpreting experimental findings when only the integrated protein amount is known and not the entire profile. 



B. Protein excess for b — 1 

For the symmetric case (c = 0) the excess is given by the closed-form expression 

r = 2(a- l)u 2 + 2tanhu 2 

with the boundary value u 2 determined by 

tan _1 [tanh(?i 2 /2)] = - - sinh u 2 
as follows from (53) and (55) and noting that A = 0. 

For the localized to semi-localized transition line, a = 1 — c, the excess is given by 

r = 1 — clog(2) + tanhw 2 — u 2 c — clog(coshu 2 ) 
with the boundary value u 2 determined by 

2c+ 1 = tanhu 2 + (1 + c)(coshw 2 — sinhw 2 )(7r/2 + 2 tan -1 (tanh [m 2 /2]) 



(65) 



(66) 



(67) 



(68) 



as follows from applying (42) to (47). 

The protein excess V for the symmetric case c = is shown in Fig. 6(a) as a function of a, where the solid line 
denotes results for 6 = and the broken line for 6 = 1. These results correspond to the concentration profiles plotted 
in Fig. 3 and are given by (63) and (65). The protein excess for 6 = 1 is smaller than for 6 = 0, which is also visible 
in Fig. 3. The overall flattening of the distributions for non-zero 6 causes the area under the distribution to decrease. 
For a = 1, the protein excess is given by T = 2 for both values of 6. The same value holds for general 6, as can be 
demonstrated by numerical solutions of (56). The boundary values u 2 given by (60) for 6 = and determined by (66) 
for 6 = 1 are plotted in Fig. 7(a). 

In Fig. 6(b) we show the protein excess on the localized/ semi- localized transition line, a = 1 — c, as a function of 
c, as given by (64) and (67). As in the symmetric case, the protein excess T decreases as 6 becomes non-zero. The 
boundary values u 2 , obtained from (61) and (68), for 6 = and 6 = 1, respectively, are plotted in Fig. 7(b). 
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VII. LE/LC LINE TENSION IN THE PRESENCE OF PROTEIN 



First we calculate the line tension of the liquid expanded-condensed interface in the absence of proteins, denoted 
by ro- This energy per unit length follows from the total free energy density 7 as given by (14) after subtraction of 
the bulk free energy density infinitely far from the interface and can be defined by 



2 J + + (69) 

recalling that 

»?o(z) = »7oo tanh(a;/^) (70) 

and ryoo = V6J and ^ = \Jg- n / J- In writing (69) we used the symmetry around x = 0. The integral (69) is elementary 
and gives the standard result 

r = 8J 3 / 2 ^ 2 (71) 

The total line tension is given by t = r + r^. The line tension contribution due to adsorbed proteins in the 
localized phase region can be written as 

t = j dx ^L<f{x) - fMf>(x) + \Vo{x)<f>{x) + ^nl{x)<j){x) + i 30 (^p) j ( 72 ) 

For simplicity, we will restrict ourselves to the limit — 0, because the integration in (72) can then be done in a 
closed form. The line tension contribution can be expressed in reduced variables as 

T0 = ^f/ du {^ $2 ( u )- a *W + c tanhw$(u)+tanh 2 ?i$(w)| (73) 

Using the solution found for b — (or, equivalently, ^ = 0), — a — tanh 2 u — ctanhu, the integral can be solved 
for general a and c. Here, we only present the solution for the symmetric case, c = 0, which is given by 



9 j3/ 2 5 y 2 



2L 



{VH(l-5a/3)-(l-a) 2 tanh" 1 (V^)} (74) 



The limiting values are = for a = 0, since in this case no proteins are adsorbed, and = —3J 3 / 2 gl/ 2 /L for 

a = 1, to be compared with t = 8J 3 ^ 2 g^ 2 . This is the smallest value possible, for larger values of a the line tension 
contribution remains constant. The adsorption of proteins thus leads to a reduction of the total line tension 
r = ro + T0. In principle, the total line tension r can take negative values for sufficiently large a if L < 3/8, which 
amounts to an instability of the LE-LC interface, possibly signaling a depression of the lipid phase transition. Of 
course, in this limit the approximations used in deriving (26) break down, since we assumed L to be large and the 
proteins being only a small perturbation on the pure lipid phase transition. 



VIII. PROTEIN PROFILE IN THE SUBPHASE 



Up to now the coupled protein-lipid system was considered as a pure two-dimensional system on the water/air 
interface which is positioned at z = 0. It is possible to evaluate the influence of a finite solubility of the proteins in 
the subphase on the protein distribution in the monolayer, and, in addition, to give a more precise meaning to the 
protein parameters fi and L used in (13). The vertical protein concentration profile in the subphase can be calculated 
as a function of the distance z from the monolayer, For the calculation of this profile, which is denoted by <f>±(z), 
we neglect any variation in the horizontal direction. Assuming that the water is a good solvent for the protein (and, 
therefore, that the aqueous protein solution is far from its demixing curve) , we can write the free energy per unit area 
on the surface as 

7± = 1^ teUgl (^^) 2 + L b 4>l(z) H b <P±(z)\ + L s 4? - ^ (75) 
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where = (j)±(0) is the protein concentration at the surface (or, equivalently, in the monolayer). This expression 
is very similar to free energy functionals studied in the context of wetting and other surface phenomena |30|] . In 
analogy to the parameters used in (13) and (15), g\, Lb, and /i b are the protein parameters in the "bulk" subphase, 
and L s and )i s are the bare protein parameters at the "surface" (or in the monolayer). The chemical potential fi s 
measures the free energy difference between a protein molecule in the subphase and in the monolayer, and it contains 
contributions due to van der Waals interactions of the protein with its surrounding media as well as hydrophobic 
contributions coming from structural changes of the protein at the surface. It is believed that proteins unfold their 
hydrophobic parts when they are inside a monolayer or even at the free air-water interface. The energy gained by 
such conformational transformations can be extremely high. 

The Euler-Lagrange equation for the bulk density profile takes the form 

2L b ^(z)- / z b =4^M (76) 

The bulk protein concentration infinitely far from the monolayer is given from (76) by 

^ = ^(00) = ^- (77) 

When the protein adsorbing on the surface is in contact with a large bulk reservoir of proteins, the bulk concentration 
4>b can be regarded as a fixed parameter and the chemical potential [ib acts as a Lagrange multiplier satisfying the 
relation /x b = 2Lb4>b- 

The solution of (76) compatible with the requirement 4>±(oo) = <f>b is given by 

£i(*) = (0-fo)e-*/3+& (78) 



where the correlation of the protein distribution in the subphase is £^ = y g^/2Lb and <j> = </>j_(0) is the surface value. 
The surface free energy, which is the total free energy due to the presence of the monolayer at z = 0, can be expressed 



as 



/■oo 

A 7 _ L =7_ L -/ dz{L b 4>l -Mb} (79) 
Jo 

For the density profile given by (78), it takes the form 



A 7± = (</>- 4> b f sJL b gl/2 + L s 4> 2 - y. s <j> (80) 
Minimizing this expression with respect to the surface protein concentration cf> leads to the value 



Ha + (f> b J2L b gl 

rTT (81) 

2L S + J2L h g\ 



Effectively, the presence of a finite concentration of proteins in the subphase can be modeled by using the modified 
parameters 

H = fi s + <t>b^2L h g\ (82) 



L = L S + jL b g b j2 (83) 



for the two-dimensional description of the protein distribution in the monolayer, given in (13). With these effective 
parameters, the (^-dependent part of the surface free energy can be rewritten (up to a constant) as Aj± = L<j) 2 — fi<j). 
For very large values of the protein adsorption free energy fi s , as observed for proteins which change their structure 
considerably as they approach the air-water interface, and a rather small reservoir of proteins in the subphase, most 
of the proteins will be incorporated in the monolayer leading to a depletion in the subphase, i.e., cpb ~ 0. In this case 
the total amount of protein in the monolayer is a conserved quantity and \i then acts as a Lagrange multiplier. In 
the case of a large reservoir of proteins in the subphase, conservation of protein particles is taken care of primarily by 
adjusting the bulk concentration cj>b. In both cases, the parameter \x, which appears in (13), can be tuned by changing 
the total amount of protein added to the system. 
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IX. EFFECT OF PROTEIN ON SURFACE PRESSURE 



We discuss now how the parameters used in our calculation can be related to experimentally measurable quantities, 
such as the lateral pressure II. This will be done for the simplified cases where there are either only proteins or only 
lipids at the water surface. First, we calculate the lateral pressure in the limit of small coverage with proteins or 
lipids, leading to a modified ideal gas law; the correction to the limiting ideal gas behavior gives information about 
the interactions. 

In the case where no lipid is present at the water surface, one sets rj = <p — 1 in (7) and the free energy per lattice 
site is 

F{4>)/T = Ktf + {01og(0) + (1 - 4>) log(l - <j>)}/ a (84) 

where a is the area ratio of the protein and the underlying lipid/ vacancy lattice. K = L + \ — J — 1/2 is an effective 
interaction parameter. For the case where only lipids are present one has to make the replacements K — > — 4( J+ 1/2), 
4> — > 4>l and a — ► 1. The thermodynamic potential for a system covering N lattice sites is defined by 

NQ = NT - fj,TN(p + UNa 2 (85) 

with a being the lattice constant of the underlying lattice of vacancies or lipids. Minimizing the potential with respect 
to the number of occupied lattice sites N and the protein density (f> leads to 

H = j- (86) 



-HK) + ^ eq = n« 2 = <p 2 



= TK<Pl q -T\og{l-^ eq )/a (87) 

4> = 4>cq 

Expanding the logarithm, one finds the behavior valid for small surface pressures 

IL4 = T + A- 1 a 2 aT(l/2 + Ka) ~T + Ha 2 a(l/2 + Ka) (88) 

where A — a 2 a/<fi eq is the surface area available per protein (or lipid if one makes the replacement a = 1). The first 
term in (88) corresponds to the ideal gas behavior, the second term is an enthalpic and entropic correction from which 
the effective interaction term K can be deduced, if the area of a protein, a 2 a, and the protein-lipid area ratio a are 
known. 

In order to estimate the critical interaction strengths, it is useful to define a new order parameter 9 = 2<p — 1 
for which the free energy expression (84) can be expanded around 9 = and then reads (to fourth order in 9 and 
neglecting terms linear in 9) 

^/T=(^ + ^-)9 2 + ^~ (89) 
4 la 12a 

from which the critical point of demixing is deduced to be K* = —2/a. Note that for the case of lipids one recovers the 
^-independent part of (13). From (89) one sees that for interaction parameters K > K* /4 the sign of the correction in 
(88) is positive, vanishes for K = K* /4 and actually becomes negative as the interaction approaches the critical value. 
Measurements of HA as a function of II for the hydrophobic polypeptide cyclosporin A in the relevant temperature 



range indeed showed positive slopes |21|, indicative of an interaction parameter K far above the critical value. The 
sign of the parameter A indicates the preference for the protein to enter dense (A < 0) or dilute lipid regions (A > 0); 
experiments indicate that this parameter is close to zero, so that L is larger than zero. Neglecting higher-order terms 
in (j) in the free energy expression (13) thus seems justified, assuming that cyclosporin A is a typical protein. 

Next we show how the effective parameter K can be related to properties of adsorbed layer of proteins or lipids 
close to their critical points. From (89), the thermodynamic potential is given by 

NQ = NT( 1 )9 2 + 9 4 — NTfi9 + NUa 2 (90) 

4 2a 12a 

Above the critical point of demixing, defined by the critical interaction strength K* = —2/a, one can neglect the 
fourth-order term and obtains upon variation with respect to N and 9 a relation between K and the equilibrium 
pressure II eg and equilibrium coverage 9 eq given by 
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where II* and T* are the pressure and the temperature at the critical point, thus material constants of the protein 
(or the lipid). 

Below the critical point of demixing and in the coexistence region one has to keep the fourth order term and obtains 
the analogous relation 



K 1 \ 2 a 2 /n mK , IF 



4 2a J 3a V T T 



(92) 



where H coe x denotes the pressure at coexistence at a given temperature T. 

For fitting experimental data to the above expression it is important to note that the interaction parameters J, L, 
and A as defined by (9-11) depend on the temperature. 



X. DISCUSSION 



We studied a simple model which explains possible aggregation of proteins or other large macromolecules at the 
boundary between coexisting liquid condensed and liquid expanded domains of lipids. Such a preferential adsorption 
of proteins has been observed experimentally plfl . Based on the general phase diagram, shown in Fig. 2, obtained in 
the limit of proteins with large areas compared with lipids (a — > oo), we predict a transition from protein distributions 
localized at the LE/LC boundary to semi- localized and delocalized distributions, for which the protein concentrations 
remain finite in the coexisting lipid phases. Such a transition can be observed by either changing the total amount 
of adsorbed proteins (corresponding to a change in a), or by changing the temperature (influencing the parameter c). 
We also calculated various experimentally accessible quantities, such as the protein excess T and the line tension r. 
The line tension is predicted to decrease upon adsorption of proteins. 

The mechanism leading to the preferential adsorption of proteins at the one-dimensional boundary line between 
LE and LC phases is due to a competition of the different contributions to the entropy of mixing of the three 
components: proteins, lipids, and vacancies. We recall that vacancies are artificially introduced just to allow the 
Langmuir monolayer to be compressible. Our model assumes that the protein actually penetrates into the monolayer. 
A partial intrusion is also possible and can be described by the model, if the proteins take up at least some area 
at the air-water interface. Other mechanisms based on long-ranged interactions such as electrostatic forces are also 
important and could lead to similar results. 

The affinity of the proteins to the LE/LC boundary can also originate from other enthalpic reasons: If the protein 
itself has amphiphilic properties with respect to the density of the surrounding medium, i.e., if one moiety of the 
protein favors a denser environment while the other moiety favors a more dilute environment, it would be driven into 
the interface between the LE and LC phases. However, such an amphiphilic property of the proteins seems to be 
unlikely, and, if present, too weak to produce the effects observed in experiments. 

Finally, we mention that similar effects should be observable for freely suspended multicomponcnt membranes which 
show phase separation into coexisting domains with different lipid compositions [^,^5 26 1. Here, integral membrane 
proteins should be either dissolved in one of the domains, depending on the enthalpic preference, or, if this preference 
is very weak, enriched and localized at the one-dimensional boundary line between the domains. 
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APPENDIX A: SOLUTION OF DIFFERENTIAL EQUATION 



Here we derive the solution of the differential equation 
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$(tt) - h(u) = b 



du 2 



with the inhomogeneous term h(u) given by 

h(u) = a — tanh 2 u — ctanhu 

Denoting by $a and <f>s two independent solutions of the homogeneous differential equation <J>(u) 
particular solution of the inhomogeneous differential equation is formally given by 



1 f u 

$p(m) = J dw h(w) 



§a{w)<S<b{u) - <&a{u)<5>b{w) 



Choosing &a(u) — Asinh(u/b), &b( u ) — Bcosh(u/b), and defining the particular solution as 



the integrals to be solved are 



1 c 
$ P (u) = a - -$i(u) - -$ 2 (u) 
b b 



= / dw tanh 2 (w) sinh 



< &2(u) = / rfwtanh(w) sinh. 

Jo \ b 



The integration is straightforward and yields 

<i) 1 (u) = b — 6cosh(u/6) + sinh(u/6) 



4b 
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$2(w) = 6 - 6cosh(u/6) 



1 1 

2 + 46 



- * 



-bF 



1 1 2 

1- • 1 • -e 

' 26' 26' 



1 

46 
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where ^[z] denotes the digamma function and /3; 7; z] denotes the hypergeometric function pl|| 
functions are defined by 



*[*] = 



dz 



with the gamma function defined as usual as 



r\z] = I f- 1 e -t d2i 

/o 



and 



i^[a;/3;7;z] 



T[7] 



r[/3]r[ 7 - p) J 



t/3-l(l-f)7-^-l(l_ te )-« di 



1G 



For the special case b = 1, the above expressions simplify and can be expressed as 

= 2 - 2cosh(u) + 2tan~ 1 [tanh(u/2)]sinh(w) (A12) 

$ 2 (w) = sinh(w) - 2tan~ 1 [tanh(w/2)] cosh(w) (A13) 
The general solution of the differential equation is given by 

= Asmh(u/b) + Bcosh(u/b) + a - $i(u)/6 - c $ 2 (A14) 
where the constants A and B are determined from the boundary conditions (see text). 

APPENDIX B: LOW PROTEIN CONCENTRATION EXPANSION 

Here we discuss the validity of the approximations leading from the Euler-Lagrange equations (21) and (22) to the 
differential equation (26). Namely, the use of the solution 770(2;) in (22) which was obtained by neglecting the two 
terms \<f> and rjcfi in (21). The solution 770(2) of the simplified differential equation obtained by setting <j> = in (21) 
can be regarded as a zeroth-order approximation to the full solution in an expansion in powers of <p(x). The validity of 
this approximation can be estimated by reconsidering the differential equation (21) and substituting for 4> the solution 
4>{x) which was found initially by neglecting the coupling terms between r\ and (f> in (21). 

Consider first the second coupling r/cj) between <\> and 77 in (21). This term is unimportant as long as <fi <C J. This 
is a reasonable assumption given that the protein concentration is small and one is not too close to the critical point 
of the liquid-expanded liquid-condensed lipid transition. This term will not be considered any further. 

In order to estimate the effect of the other term which was neglected, \<f>, we define 

r](x) = m (x) + 8t)(x) (Bl) 

with T]o(x) given by (23) and r/(x) denoting the exact solution of (21). Since 770(2;) solves equation (21) without the 
terms proportional to <ft, the differential equation for 8rj(x) neglecting terms of (D(5r) 2 , Srjtfi) is given by 

S V (x)(-2J + V 2 Q (x)) + <P(x){\ + no(x)) = g v Sr,"(x) (B2) 

From the differential equation (21), one sees that the correction we are estimating here is important only for 

A</>> I - 2 J770 + ?7o73| - I -2J?7o| (B3) 

The last step follows since 770 (x) has to be much smaller than unity for the inequality to hold. This can only be true in 
the close vicinity to the interface between dense and dilute lipid regions, i.e., for x 0. Consequently, the correction 
5rj(x) is only important around x = 0. Then, the terms proportional to 770(2;) can be neglected and the differential 
equation (B2) simplifies to 

-2J5r,{x) + \<t>{x) = g v 5r)"(x) (B4) 
Replacing <j>{x) by its value at the origin, (f)(0), the solution of (B4) is formally written as 

S v (x) = ^jp- + Csm(V2x/^) + Dcos(V2x/£ v ) (B5) 

In order for the correction £77(2;) to vanish outside the region of interest centered around x w 0, both coefficients C 
and D have to be of the order as the constant X(p(Q)/2J. The magnitude of the correction is thus given by 

m„?m ( B6) 

J Voo 

Note that c is a parameter of order unity (or smaller) in the localized protein region (see Fig. 2). Thus, the correction 
Srj enters in the calculation of the protein distribution <f>(x) as a higher order contribution in terms of the ratio 
<?KO)/77oc j which is a small parameter. Neglecting this correction is a controlled approximation corresponding to 
keeping only the first order in a general expansion in terms of 4>(0)/t] oo , the ratio of the protein concentration and 
the lipid concentration difference. 
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FIG. 1. Bulk phase diagram for L — 10, J = 1/10, and for a — 10 (short dashes), 50 (long dashes), and 200 (long-short 
dashes) as a function of the rescaled chemical potential parameter a and the interaction parameter c. The solid lines denote 
the phase boundaries for the limiting case a — > oo. In the delocalized-phase region the protein concentration in the dense and 
dilute lipid regions is finite; in the so-called "semi-localized" region only the dilute lipid region contains proteins (for negative 
values of c only the dense lipid region contains proteins) , and in the region denoted by "no proteins" the protein concentration is 
very small ( ~ exp[— a]) in both coexisting lipid regions. In part of the "no protein" region, the solution of the Euler-Lagrangc 
equations gives a new localized protein distribution in the neighborhood of the LE/LC boundary, see Fig. 2. 
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FIG. 2. Phase diagram for general 6, valid in the limit a — > oo. The shaded area denotes the localized regime, in which the 
protein distribution is localized at the boundary between dense and dilute lipid regions. The special point S is the limiting 
localized case where the maximum of the protein distribution is infinitely far away from the interface between the liquid 
condensed and liquid expanded phases. The broken lines denote lines of constant excess protein F calculated for the special 
case 6 = 0. The phase diagram is symmetric with respect to the a axis (c = line). 

FIG. 3. Protein distributions for the symmetric case defined by c = for different values of a; solid lines denote 6 = and 
broken lines denote 6=1. 

FIG. 4. Asymmetric protein distributions at the boundary between the localized case and the semi-localized case, defined 
by a = 1 — c; solid lines denote 6 = and broken lines denote 6=1. The left boundary ui is located at ui = -co for all values 
of 6. 

FIG. 5. Protein distribution <3?(tt) for c = and a = 0.5 for the following values of 6 (from top to bottom): 6 = 0, 0.2, 0.6, 1, 
1.6, and 2.8. The limiting value of u 2 for 6 — > oo is given by u 2 = 1.915. 

FIG. 6. (a) Protein excess F for the symmetric case c = 0; the solid line denotes 6 = and the broken line denotes 6=1. At 
a — 1 the excess is F = 2 independently of the value of 6. (b) Protein excess F on the localized/semi- localized transition line, 
defined by a = 1 — c. 

FIG. 7. (a) Boundary value u 2 for the symmetric case c = for 6 = (solid line) as given by (60) and for 6 = 1 (broken 
line) as determined by (66); note that here Ui = — U2- (b) Boundary value 112 for the localized to semi- localized transition line, 
defined by a = 1 — c, for 6 = (solid line) as given by (61) and for 6 = 1 (broken line) as determined by (68); note that here 
ui — —00. 
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